Y - Randomization – A Useful Tool in QSAR Validation , or Folklore ?
نویسندگان
چکیده
Several variants of randomization procedures were compared as a tool in validation of multilinear regression (MLR) QSAR equations that are obtained by descriptor selection. Y-randomization, a method formerly said to be probably the most powerful validation procedure, was found to be overoptimistic. The statistical significance of a new MLR QSAR model should be checked by comparing its measure of fit to the average measure of fit of best random pseudomodels that are obtained using random pseudodescriptors instead of the original descriptors and applying descriptor selection as in building the original model. Application of this criterion to several recently published MLR QSAR equations identified dubious ones. Some progress also is reported towards the goal of obtaining the mean best r of random pseudomodels by calculation rather than by tedious multiple simulations on random number variables.
منابع مشابه
y-Randomization and Its Variants in QSPR/QSAR
y-Randomization is a tool used in validation of QSPR/QSAR models, whereby the performance of the original model in data description (r2) is compared to that of models built for permuted (randomly shuffled) response, based on the original descriptor pool and the original model building procedure. We compared y-randomization and several variants thereof, using original response, permuted response...
متن کاملIn Silico Multivariate Regressio n Analysis and Validation Studies on Selective MMP-13 Inhibitors
QSAR(Quantitative Structure Activity Relationship) studies were carried out on a set of 72 α-sulfone hydroxamatesas Matrix Metalloproteinase-13 (MMP-13) inhibitors using multiple regression procedure. Outliers were removed based on Relative Error calculation and Extent of Extrapolation. The activity contributions of these compounds were determined from regression equation and the validation pro...
متن کاملQuantitative structure-activity relationship (QSAR) study of CCR2b receptor inhibitors using SW-MLR and GA-MLR approaches
In this paper, the quantitative structure activity-relationship (QSAR) of the CCR2b receptor inhibitors was scrutinized. Firstly, the molecular descriptors were calculated using the Dragon package. Then, the stepwise multiple linear regressions (SW-MLR) and the genetic algorithm multiple linear regressions (GA-MLR) variable selection methods were subsequently employed to select and implement th...
متن کاملA comparative QSAR study of aryl-substituted isobenzofuran-1(3H)-ones inhibitors
A comparative workflow, including linear and non-linear QSAR models, was carried out to evaluate the predictive accuracy of models and predict the inhibition activity of a series of aryl-substituted isobenzofuran-1(3H)-ones. The data set consisted of 34 compounds was classified into the training and test sets, randomly. Molecular descriptors were selected using the genetic algorithm (GA) as a f...
متن کاملQSAR Modeling of COX-2 Inhibitory Activity of Some Dihydropyridine and Hydroquinoline Derivatives Using Multiple Linear Regression (MLR) Method
COX-2 inhibitory activities of some 1,4-dihydropyridine and 5-oxo-1,4,5,6,7,8-hexahydroquinoline derivatives were modeled by quantitative structure–activity relationship (QSAR) using stepwise-multiple linear regression (SW-MLR) method. The built model was robust and predictive with correlation coefficient (R2) of 0.972 and 0.531 for training and test groups, respectively. The quality of the mod...
متن کامل